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Abstract 

Researchers in the field of epigenomics are developing more nuanced understandings of biological complexity, and 
exploring the multiple pathways that lead to phenotypic expression. The concept of degeneracy — referring to the 
multiple pathways that a system recruits to achieve functional plasticity — is an important conceptual accompani- 
ment to the growing body of knowledge in epigenomics. Distinct from degradation, redundancy and dilapidation; de- 
generacy refers to the plasticity of traits whose function overlaps in some environments, but diverges in others. 
While a redundant system is composed of repeated identical elements performing the same function, a degenerate 
system is composed of different elements performing similar or overlapping functions. Here, we describe the degen- 
erate structure of gene regulatory systems from the basic genetic code to flexible epigenomic modifications, and dis- 
cuss how these structural features have contributed to organism complexity, robustness, plasticity and evolvability. 

Keywords: epigenetic code; pluripotentiality; robustness; redundancy; DNA methylation; histone modifications; social insect; 
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INTRODUCTION 

The ability of natural selection to give rise to a large 
number of nonidentical structures capable of pro- 
ducing similar functions appears to increase both 
the robustness of biological networks and their 
adaptability to unforeseen environments by provid- 
ing them with a large repertoire of alternative func- 
tional interactions. Tononi and Edelman [1]. 

Biological systems can do a lot with a little. 
Consider, for example, the number of protein- 
coding genes in various organisms with distinct 
evolutionary histories. The human genome has to 
provide developmental cues for generating distinct 
tissues, organs, hundreds of cell types, and for build- 
ing and wiring a centralized brain with 100 billion 
neurons and 100 trillion synaptic connections. 
And yet the human genome encodes nearly the 
same number of genes as does the genome of 



Caenorhabditis elegans, a tiny eutelic nematode with 
only 959 cells, including just 302 neurons located 
in dispersed ganglia [2]. Data overwhelmingly indi- 
cate there is no simple relationship between gene 
number, neuron number and evident morphological 
and behavioural complexities of animals in different 
phyla [3]. Indeed, organisms with highly advanced 
brains and complex behaviours, such as primates, 
have fewer genes than the water flea Daphniapulex 
(22 000 and 31 000, respectively). Consequently for- 
mulating a genomic interpretation of the develop- 
ment and evolution of new biological forms is 
proving far more difficult than previously 
anticipated. 

Organisms can employ many different molecular 
systems to achieve the same end result, and many 
different designs, such as diverse gastrulation patterns, 
can be manufactured during embryogenesis to solve 
the same developmental problem. Frequently, to 
evolve new morphological or behavioural forms 
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evolution has had to reuse and adapt existing elem- 
ents by utilizing the inherited genomic sequences in 
a new context-dependent manner. 

A multi-level regulatory network consisting of 
such mechanisms as modular utilization of protein 
domains, alternative splicing and epigenomic modi- 
fications of DNA has been the driving force behind 
the wide radiation, rapid evolution and evolutionary- 
success of eukaryotic organisms. Here, we argue that 
the degenerate organization of the genome and epi- 
genome is a key feature enabling the evolutionary- 
process to create new forms. A degenerate system 
contains multiple structurally distinct elements per- 
forming similar functions (Box I and II; Figure 1). 
Ordinarily degeneracy promotes stability in a 
self-organizing system, but degeneracy also allows 
elements to functionally diverge by an evolutionary 
process, and become exapted to a new function 
without any loss of coherency to the original system. 

Degeneracy greatly increases the capacity of a 
limited and fixed number of genes to generate 

Box I 

Definitions 

Isomorphic Structures that are identical. For example, the 

special molds at the Billund factory ensure 
that each 2x2 Lego brick (Design ID 3003) is 
isomorphic 

Isofunctional Performing the identical function. For example, a 

pen, a pencil and a quill can all perform the 
same function 

Heteromorphic Structurally different elements. For example, a 
pen and a quill are heteromorphic 

Degeneracy The structural variation that underpins functional 

plasticity. For example, pens and pencils can 
perform the same function with respect to 
context. On earth, both pens and pencils can 
be used to write. Without structural modifica- 
tion, however, only pencils work in space 



Box II 




Structure 


Function 


Context 


(Unspecified) 


Many 


Many 


Independent 


Redundancy 


One 


One 


Independent 


Degeneracy 


Many 


One 


Dependent 


Pluripotency 


One 


Many 


Dependent 



Redundant components have a structure -function ratio of one-to-one 
irrespective of context. Degenerate components have a structure-to- 
function ratio of many-to-one. Pluripotential elements change function 
according to context. 



morphological and behavioural complexities. Here, 
we review the history of the concept of biological 
degeneracy, and discuss the importance of degener- 
ate organization for the function and evolution of 
the genome and epigenome. 



BIOLOGICAL DEGENERACY 

Degeneracy is an unfortunate word for a useful con- 
cept. In contemporary biological science, degeneracy 
refers to structural variation underlying functional 
plasticity. Systems with degenerate components 
have a structure-to-function ratio of many-to-one 
(Box I and II). Degeneracy is a distributed property 
of complex adaptive systems that in many circles of 
science has been hidden in plain sight [4], commonly 
overlooked because of a reductionist bias [5,6], and 
ignored because the term itself is misleading [7]. 
Although degeneracy is known to be a characteristic 
of genetic codes [8,9], immune systems [10], respira- 
tory network regulation of blood-gas homoeostasis 
[11], human movement [12,13], cognitive neuro- 
anatomy [14—16], population dynamics [17,18], and 
as a conceptual tool offered the final solution to the 
coding problem of DNA [19], the term is still not 
well comprehended in evolutionary biology. 

Commonly, structurally different but functionally 
similar degenerate components are often mislabelled 
as redundant, a term that actually refers to identical 
elements performing the same function (Box I and 
II). The biological concept of degeneracy suffers 
from association with a value-laden use outside of 
the sciences that has gained a host of negative asso- 
ciations [7,20,21]. 

A living system exhibits degeneracy if it contains 
multiple different structures that can perform a simi- 
lar function. The ability to perform the same tasks by 
different mechanisms prevents unbearable fluctu- 
ations and the propagation of cascading failures in a 
system [22]. A dynamic self-organizing system must 
also strike a compromise between the over-stabiliza- 
tion of networks and the noise within and between 
various networks, as too much specificity reduces 
adaptability [5], whereas some reduced specialization 
provides a capacity for plasticity and adaptability. It is 
very common in biological systems for molecules or 
cells to recognize a range of targets, and for target 
ranges to partially overlap between different elem- 
ents. These molecules and cells are called degenerate. 
Degeneracy operates at multiple levels of complex- 
ity. Dissimilar genes can produce the same 
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Many structurally distinct factors can influence gene activities yielding similar effects 
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Figure I: A simplified diagram highlighting the structural complexity of gene regulatory elements in eukaryotic 
organisms. Left-right arrows indicate bidirectional communications, feed-back and feed-forward loops. For more 
details, see the main text and selected recent articles [31-36,45,46,54-65]. 



developmental output; unrelated populations of 
neurons can subserve the same behavioural task; 
and different patterns of muscular contraction can 
yield similar movements. In biological systems, the 
loss of a component can be compensated by redun- 
dant elements (the presence of other identical com- 
ponents) or by degenerate elements (structurally 
different components that perform the same function 
with respect to context). 

Degeneracy is a phenomenon whereby different 
structural permutations recurrently lead to similar 
end results, but with changes in context, the elem- 
ents of a system might change function in which case 
they are described as pluripotential with a structure- 
to-function ratio of one-to-many (Box I and II). 
Pluripotentiality is the complement of degeneracy 
[16]. In the immune system, for instance, degeneracy 
of antigen receptors enables any single epitope to 
activate many different lymphocyte clones, and sim- 
ultaneously any single lymphocyte clone is able to 
recognize many different epitopes [23]. Tieri et al. 
[24], borrowing a term from Csete and Doyle [25], 



refer to the overlap between degeneracy and plur- 
ipotentiality as a bowtie. Many inputs funnel into a 
thin knot of interlocking networks and subsequently 
many corresponding outputs fan out. The prime ex- 
ample of a bowtie is the transcription and translation 
of DNA to proteins. A large variety of genes produce 
a few universal polymerase modules — the 'knot' of 
the bowtie — and a large variety of proteins result 
[26]. 

Degeneracy is not limited to the internal struc- 
tures of an organism, but may also occur between 
internal structures and environmental resources. 
Deacon [27] gives the example of endogenous ascor- 
bic acid synthesis (vitamin C) existent among some 
primate lineages and missing in others. All prosimians 
except Tarsiers synthesize ascorbic acid endogen- 
ously but anthropoid primates have lost this function. 
A shift in diet among anthropoid ancestors has led to 
a reliance on acquiring ascorbic acid from dietary 
sources such as fruit. Once food sources containing 
ascorbic acid were available in reliable and plentiful 
quantities, the gene responsible for endogenous 
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ascorbic acid synthesis was no longer needed, became 
selectively neutral, and was free to accumulate mu- 
tations without deleterious outcomes for the organ- 
ism. Mutational variants were no longer eliminated 
because exogenous ascorbic acid became regularly 
available. Selection began to operate not simply on 
genes for ascorbic acid synthesis but also across a 
distributed network of sensory biases, behavioural 
inclinations and digestive-metabolic mechanisms 
that increased the probability of obtaining ascorbic 
acid from the environment. In this way, within cer- 
tain degrees of freedom, if there is degeneracy be- 
tween environmental and genomic factors, then 
selection can result in an offloading of function 
from the genome to the environment, or a potential 
divergence of the environmental and genomic elem- 
ents leading to the random exploration of adjacent 
function space. 

DEGENERACY IN THE GENOME 
AND EPIGENOME 

Degeneracy is a key organizational feature of our 
genetic code [8,9]. All but two amino acids are 
encoded by more than one triplet codon, with 
each set of codons specific only for one amino 
acid. In total, there are 64 different codon combin- 
ations or 'ciphers' in the degenerate genetic code for 
just 23 amino acids. This evolutionary invention 
provides several adaptive benefits. For example, bac- 
teria can adapt protein synthesis to a limited avail- 
ability of certain amino acids, by taking advantage of 
'degeneracy lifting', a process that allows degenerate 
systems to display a variety of behaviours, depending 
on environmental settings. Nutritional perturbations 
lift the degeneracy of the genetic code by splitting 
codon families into robust and sensitive synonymous 
codons that results in 100-fold higher rate of protein 
synthesis associated with robust codons [9]. 
However, as the structural complexity of biological 
systems increases and connectivity of multiple parts 
becomes non-linear, new molecular strategies are 
needed to differentially utilize the genome in distinct 
cell lines of eukaryotic organisms, to coordinate de- 
velopment and cellular identity and to structure and 
stabilize genome— environment interactions. 

Progress in epigenomics has shown that the 
genome is regulated by a multifaceted and highly 
degenerate system involving families of transcription 
factors interacting with various chemical modifica- 
tions of genomic DNA or its packaging proteins 



(Figure 1). While these do not alter the underlying 
genetic code, they do establish regulatory marks 
along the genome that operate with general tran- 
scription factors to coordinate structured gene ex- 
pression. The epigenetic system is a self-organizing 
regulatory level of organization that operates above 
the genome and provides the high level of flexibility 
needed for coordinated and context-dependent ex- 
pression of multitudes of genes. 

The modern era of epigenetics originated with 
the seminal works of Conrad Waddington who in 
1939 introduced the concept of 'epigenotype', a 
quality whose mode of impact was over the classical 
genotype [28]. Waddington used this descriptor in 
the context of developmental canalization to explain 
how a particular organ is produced by both the 
genotype and the epigenotype reacting with the ex- 
ternal environment. Epigenetic research moved from 
a genetics-based, to a methylation-based, to a CpG 
island-based, and more recently to genome-wide 
methylomics, initiated by the seminal articles of 
Riggs and associates [29,30]. One often used defin- 
ition of epigenetics is presented by these authors as 
'the study of mitotically and/ or meiotically heritable 
changes in gene function that cannot be explained by 
changes in DNA sequence' [30]. 

One mechanism for epigenetic inheritance is via 
cytosine methylation whereby gene expression can 
be modulated by adding or removing methyl groups 
to the DNA. This process is generally set as part of 
the normal genetic program, but recent studies have 
shown that methyl tags can be reset by a variety of 
factors including viral infection, drug treatment or 
even simple components of a regular diet [31—36]. 
Another important epigenetic mechanism operates at 
the level of chromatin via histone modifications 
[36,37]. However, at least theoretically, the transmis- 
sion of epigenetic information might be mediated 
also by other cellular macromolecules including 
RNA, proteins and sugars or even lipids and cellular 
membranes [36]. Experimental evidence suggests 
that some of these DNA and histone modifications 
are responsive to both internal and external cues and 
can be rapidly activated in a context-dependent 
manner [31—38]. Acting alone or in combination, 
epigenomic modifications have both short-term 
and long-term effects on gene regulatory cascades, 
on DNA replication and chromatin conformation. 
This vast array of biochemical decorations of gen- 
omic sequences acts as a highly flexible epigenetic 
code that has the capacity to regulate not only 
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individual genes but also entire networks. Adjusting 
topology of cellular networks by genome-wide 
reprogramming is a way of re-interpreting genetic 
hardware to generate contextual epigenetic states. 
Some of the epigenomic modifications underlying 
a given epigenetic signature become stable and can 
be maintained during the life of an organism as a 
durable phenotype. However, an environmental 
shift can easily enforce a new round of programming 
leading to a different more suitable phenotypic out- 
come [31—38]. In addition to plants and animals, a 
similar interplay of environmental and epigenetic 
contexts has also been reported in more elementary 
pathways, such as the cell autonomous DNA damage 
response in yeast [39]. 

Conceptualizing the epigenetic system as degen- 
erate is useful for mapping the complex, dynamic, 
self-organizing pathways of biological activity. 
Epigenetic systems exhibit degeneracy since several 
distinct mechanisms used to control gene regulation 
can yield similar end results. For example, genes can 
be turned on and off by the action of transcription 
factors or by promoter methylation (or chromatin 
configuration) [40—44]. Alternative splicing is con- 
trolled by multifactorial cellular machinery including 
DNA methylation and histone positioning, acting 
together or as individual epigenomic modifiers 
[45—47]. Oxidation of 5-methyl cytosine (5Mc) to 
5-hydroxy mC (5hmC) is considered the main path- 
way in removing the methyl tags from the genome, 
but in the brain increased levels of 5hmC in gene 
bodies have been shown to correlate with transcrip- 
tional activation [48] . Within the neuronal function- 
related genes, gain of 5hmC is accompanied by loss 
of H3K27me3, an important histone mark that is 
catalysed and maintained by Polycomb Repressive 
Complex 2 (a histone methyl- transferase, [48]). 
Although H3K27me3 has been implicated in tran- 
scriptional silencing, its localization either in gene 
bodies or around transcription start sites suggests 
that its mode of action is degenerate with other epi- 
genetic elements [49] . More evidence suggesting that 
the multifactorial epigenetic code is degenerate 
emerges from high-throughput epigenomic data. 
By integrating genome -wide profiles for a histone 
mark (H3K36me3) with two sets of transcriptional 
RNAseq data, Althammer etal. [50] found that epi- 
genetic regulation of gene activities involves multiple 
intragenic regions, specifically the first exon, first 
intron and a region downstream of the polyadenyla- 
tion site. Furthermore, the histone modifications 



appear to be highly heterogeneous and their regula- 
tory effects depend on the structural features of pro- 
moters and genes (GC-content and presence of 
introns). Their results clearly show that the histone 
code is highly degenerate with different groups of 
attributes from different genie regions leading to 
similar regulatory events. 

Another striking example of gene activity regula- 
tion via a degenerate epigenomic mechanism is silen- 
cing of the ribosomal RNA (rRNA) gene cluster by 
non-coding RNA [51]. In this model, an RNA mol- 
ecule, complementary to the rRNA promoter me- 
diates de novo CpG methylation of rRNA genes by 
interacting with the transcription factor TTF-1 and 
forming a DNA:RNA triplex, which recruits the 
DNA methyltransferase DNMT3b. Thus, RNA- 
dependent DNA methylation acting together with 
histone modifications might be required to select- 
ively shut down a subset of rRNA genes by forming 
heterochromatic states that have been shown to be 
associated with inactive rRNA genes. 

At least some of these epigenomic regulatory 
events appear to be bidirectional. De Almeida et al. 
[52] have recently shown that gene splicing con- 
trolled by trimethylation of histone H3 Lys3 can in- 
fluence the methylation of H3 Lys3 by recruiting, via 
an unknown mechanism, methyltransferase HYPB. 
This process, linked to genes' structural features (first 
intron length), not only actively modulates histone 
H3 Lys 3 modifications but also improves quality of 
intron splicing. Although introns are demarcated by 
short sequence motifs, this 'splicing code' is poorly 
conserved and lacks adequate information to recog- 
nize correct splicing sites. The emerging view is that 
a proper choice of splicing events is regulated by a 
coordinated action of many components including 
RNA polymerase II, ribonucleoprotein particles, 
hundreds of auxiliary proteins, chromatin factors, 
DNA methylation and histone modifications [46,52]. 

Elaborate manners of regulation, such as interact- 
ing DNA-methylation and histone modification sys- 
tems, are likely to be the hallmarks of the epigenetic 
code. The combinatorial utilization of flexible epi- 
genomic modifications, together with gene splicing, 
post-translation protein modification and RNA edit- 
ing has ample potential to generate functional diver- 
sity from a fixed genotype (Figure 1). In contrast to 
only 64 codon combinations in the genetic code, the 
number of possible 'ciphers' in the epigenetic code 
or the level of degeneracy could be many orders of 
magnitude higher [53]. This high level of degeneracy 
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provides virtually unlimited coding potential to 
ensure both developmental buffering and flexibility 
to deal with random external factors. 



DEGENERACY PROVIDES BOTH 
STABILITY AND PLASTICITY TO 
THE EPIGENOME 

The idea that epigenetic mechanisms can generate 
similar phenotypic outcomes by adjusting expression 
of different genes is not new. Seventy years ago, 
Waddington argued that epigenetic processes are 
part of developmental canalization, or buffering of 
the genotype against external perturbation [28]. He 
noted that developmental reactions 'are adjusted to 
bring about one definite end result regardless of 
minor conditions, hence the remarkable constancy 
of the wild type' [28]. Although Waddington has 
never used the term degeneracy, his pioneering 
ideas brought into focus the flexibility of cellular 
responses to external stimuli and their ability to 
select 'a suitable genetically controlled reactivity in 
the organism'. He illustrated his concept of genotype 
buffering using the phenomenon of 'phenocopying', 
whereby the phenotypic effect of a mutation can also 
be created by non-genetic means. For example, in 
Drosophila, phenocopies for both pigmentation and 
behaviour of a mutation known as 'yellow' can be 
produced by a number of treatments, including 
pharmacological inhibitors of tyrosine hydroxylase 
or by growing larvae on silver nitrate [66,67]. 
Another notable example of phenotypic buffering 
against the effect of mutations is the combined 
action of environmental stress and stochastic expres- 
sion of protective genes encoding chaperone proteins 
[68]. Molecular chaperones or heat-shock proteins 
are considered important drivers of evolutionary 
change because of their role in folding or assembling 
macromolecular structures that is highly dependent 
on receiving specific internal or external signals [69]. 
In C. elegans, a potentially lethal outcome of a muta- 
tion in the transcription factor lin-29 can be signifi- 
cantly weakened after a mild heat shock delivered 
during early stages of development [69]. This pro- 
tective effect of stress has been correlated with higher 
stochastic chaperone expression. However, lower re- 
productive fitness associated with higher abundance 
of chaperones suggests that at the whole population 
level, inter-individual variability in the expression of 
these proteins provides a better survival strategy in 
erratic environmental conditions. Encapsulating the 



heterogeneous construction of phenotypic buffering, 
the concept of degeneracy highlights the array of 
pathways that can lead to a similar phenotypic 
endpoint. 

DEGENERACY OF THE 
EPIGENOME, PHENOTYPIC 
POLYMORPHISM AND 
EVOLVABILITY 

Permutations of structure that recurrently lead to the 
same end-point provide a selective repertoire of 
flexible mechanisms with variable specificities. With 
these degenerate mechanisms of organization, 
selection is able to act differentially upon two or 
more structurally distinct traits that can fulfil the 
same role within specific contexts. Selective systems 
have a great number of elements that each respond at 
varying specificities to incoming signals [70]. In 
other words, environmental influences can act 
upon specific elements as well as populations of 
'degenerately responding, possibly less specific, elem- 
ents' [71]. 

Degeneracy means that natural selection acts not 
only in evolution but also in ontogenesis by sorting 
stochastic degenerate interactions at multiple levels of 
complexity [72] . Mathematical modelling has shown 
that competition for resources by degenerate reper- 
toires is a mechanism of self-organization that can 
also explain evolutionary branching [73] and possibly 
even speciation [18]. Degeneracy can account for the 
divergence of traits into subpopulations and the 
spontaneous self-structuring of degenerate reper- 
toires into distributions inconsistent with the avail- 
able resources in the environment [70,71]. In many 
ways, the inclusion of degeneracy in biological 
theory answers the call of West-Eberhard to 'change 
the way biologists think about the origins of organic 
diversity' [74]. 

New directions in the evolution of an organism 
start with a population of variably responsive, devel- 
opmentally plastic mechanisms [75]. As a mechanism 
of intraspecific variation, degeneracy contributes to 
evolvability through processes that are not immedi- 
ately adaptive. As a result of competition for re- 
sources, the interaction of degenerate repertoires 
with selective constraints gives rise to self-organiza- 
tion [70] . Mathematical modelling has revealed that a 
population of degenerate systems spontaneously 
manifest self-organization patterns that do not 
mirror incoming signals or the distribution of 
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resources [70,71]. In other words, degeneracy offers 
an alternative to adaptation in explaining the features 
of an organism. Not every trait is an adaptation to 
the environment that it currently inhabits, or put 
another way, not all traits were necessarily selected 
for their current function. Incorporating degeneracy 
into the study of biological systems brings attention 
to the many ways that variation can arise while main- 
taining organism function. 

Degenerate epigenomic regulatory systems can 
diverge through accumulation of random mutation 
without compromising the phenotype of the organ- 
ism. This makes possible the evolution of new 
genome regulatory pathways and potentially new 
developmental programs that utilize existing coding 
sequences in new combinations. Unlikely as it may 
sound growing evidence suggests that such changes 
more frequently underlie the evolution of new 
behavioural or morphological structures than the 
evolution of 'new' coding genes. 

A particularly pertinent example is the evolution 
of the human brain. Humans and chimpanzees have 
almost identical genomes and their brains are ana- 
tomically very similar, except that human brain is 
about three times larger than that of chimpanzees 
[76]. This significant volumetric expansion of the 
hominid brain began only 2 million years ago and 
posed a major challenge for rapid development of 
new regulatory circuits. It has been argued that a 
few genomic regions called human accelerated re- 
gions (HARs) contain clues to the enhanced changes 
associated with the evolution of human brains. 
HAR1, in particular, has been suggested to be im- 
portant for brain evolution [76]. HAR1 is highly 
evolutionarily conserved among mammals, but has 
diverged rapidly in humans since the last common 
ancestor with chimpanzees [76]. Although HAR1 is 
essentially deprived of protein coding genes, it con- 
tains the so-called non-coding RNA genes including 
HAR1F that is expressed in the mammalian neocor- 
tex and is now a candidate for being part of a process 
that brought about innovative structural modifica- 
tions in the human brain. Another interesting 
example of an RNA-coding gene implicated in 
human evolution is mir941; a micro RNA that 
emerged de novo in the human lineage from an un- 
stable tandem repeat DNA sequence between 6 and 
1 million years ago. mir941 has been implicated in 
reorganizing gene regulatory networks controlling 
cellular differentiation and neurotransmitter signal- 
ling [77]. 



Interestingly, analogous results are emerging from 
studies in invertebrates. The brain architecture of 
social honey bees (Apis mellifera) and solitary flies 
(e.g. Drosophilidae) is virtually identical, but the bees 
have approximately 10 times more neurons [78,79] 
in the cephalic ganglia, a richer behavioural reper- 
toire and a unique symbolic 'language'. Like in 
humans, the evolutionary behavioural novelties in 
the honey bees are not associated with an increased 
gene number. In comparison with other animal spe- 
cies, insects have a relatively small number of genes, 
ranging from ~15 000 to 17 000 [2]. Furthermore, 
non-coding honey bee genes have also been found 
to be important for brain functions. The expression 
of a transcription factor-like RNA-coding gene 
kakusei is associated with an increased neuronal ac- 
tivity in a subset of neurons in brains of foraging bees 
communicating the location of food resources via the 
dance language' [80]. Another non-coding gene 
Nb-1 has been implicated in modulating octopamine 
and juvenile hormone release during a worker bee 
behavioural transition from nursing to foraging [81]. 
One implication of these findings is that the existing 
protein and/or gene networks can be rewired by 
new epigenetic regulatory circuits to generate new 
forms of communication and social behaviour with- 
out inventing novel protein-coding genes. Instead, 
fast evolving RNA molecules of various types and 
sizes appear to control the differential recruitment of 
a hierarchy of general chromatin and DNA modify- 
ing complexes to specific loci during differentiation 
and development. This multilayer process creates 
multiple functional versions of the same genome, 
or epigenomes that have that capacity to interpret 
the genomic information in a context-dependent 
manner. 

The evolution of new developmental programs 
through changes in epigenetic regulation has enabled 
the evolution of the highly complex and morpho- 
logically specialized social insects. Diagnostic of the 
advanced social insects is the occurrence in a colony 
of different morphs specialized for reproductive roles 
and non-reproductive colony support roles. In the 
eusocial hymenoptera (e.g. ants, bees and wasps) 
these are the queen and worker castes. New evi- 
dence is revealing how these different phenotypes 
result from different patterns of epigenetic regulation 
of the genome. In this way, the action of selection 
on degenerate epigenetic systems has yielded a whole 
new level of biological organization that currently 
dominates most terrestrial ecosystems. 
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Two phenotypically distinct female honey bees 
(A. mellifera), queens and workers, are encoded by 
one genome whose mode of expression can be con- 
ditionally modulated by nutritional input [82—85]. 
Feeding a complex diet known as royal jelly to a 
growing female larva inhibits global DNA methyla- 
tion, increases levels of juvenile hormone and cor- 
relates with changes in gene expression, which result 
in the queen phenotype. In contrast, larvae fed less- 
nutritious worker jelly develop into functionally 
sterile short-lived worker bees. However, there are 
no specific 'queen' or 'worker' genes in the Apis 
genome. During the initial critical 96 h of larval 
growth, multiple sensory and secretory systems are 
involved in receiving, processing and conveying the 
nutritional information to multilevel, interlocked 
signaling pathways. The contrasting phenotypes 
result from threshold-based processes driven by 
metabolic fluxes, hormonal changes and differential 
methylation and expression of many genes [83]. All 
these components have the capacity to respond to 
environmental change, but their combined and 
coordinated action has evolved in honey bees as a 
powerful mechanism for reprograming the entire de- 
velopmental trajectory with profound consequences 
for cellular and organismal phenotypes [82—85]. 

The implications of the honey bee findings are 2- 
fold. First, they illustrate a candid point that gene 
products are mere parameters in networks and it is 
network fluxes and network equilibria that need to 
be understood [86—88]. There is unlikely to be a 
single major hub to which a given complex pheno- 
type can be attributed, and hence why the linear 
explanations of genome-to-phenotype correlations 
have largely failed [86,87]. Indeed, network model- 
ling of putative gene interactions based on the oc- 
currence of overrepresented motifs in the upstream 
control regions and differentially expressed genes 
(DEGs) shows that worker's network is more inter- 
connected than queen's network. This suggests that 
the worker DEGs share more conserved cz's-elements 
when compared with queen DEGs [85]. 

Second, a fixed number of genes can be epigen- 
etically programmed to yield more than one organ- 
ismal outcome, suggesting that epigenomic modifiers 
have the capacity to relax evolutionary constrains on 
development. These modifiers operate by recruiting 
only a subset of an organism's gene repertoire and 
reusing it in a combinatorial manner. Differential 
epigenomic modifications of over 2000 genes 
lead to remodelling multiple sub-networks [83]. 



Interestingly, the impetus for this process comes 
from metabolic flux and is driven by the most con- 
served organic molecules: the metabolic enzymes. 
Changes in DNA methylation and post-translational 
alteration in chromatin complexes, lead to both 
silencing and activation of genes, and result in altered 
expression profiles at the transcriptomic, proteomic 
and metabolite levels. The relative contribution of 
these changes to alternate developmental trajectories 
will require additional molecular and cell biological 
data. 



CONCLUSION: DEGENERACY AS 
A DESIGN PRINCIPLE FOR 
GENOME FUNCTION 

Technological innovation is no longer a limiting 
factor in biomedical sciences. The speed and decreas- 
ing cost of second generation DNA sequencing is 
producing an unparalleled amount of raw data that 
is expected to drive major breakthroughs in both 
basic and applied research. The challenge is no 
longer in generating more data, but in converting 
them into knowledge. To avoid being distracted 
by terabases of sequencing reads and their perceived 
'functionality' we need new conceptual frameworks 
to fully comprehend the complexity of biology 
behind the genomic hardware. 

In this article, we have argued that degeneracy is a 
central organizational feature of genomes and epi- 
genomes. Epigenomic systems in particular consist 
of multiple non-identical elements with partially 
overlapping functional features. We are certainly 
not the first to recognize the importance of 
degenerate organization in the understanding of 
structure— function relations. Edelman argued that 
'. . . degeneracy implies a certain relaxation of con- 
straint during development, opening certain evolu- 
tionary possibilities in a given set of phenotypes' and 
provides 'the necessary leeway for evolutionary 
changes that would otherwise lead to blind ends or 
lethality' [89]. Here, we have gathered new and 
emerging examples from epigenetics that show 
how degeneracy contributes to the stabilization of 
phenotypes, and can allow the same functional 
phenotypic outcome to be reached by very different 
specific mechanisms. These mechanisms are not lim- 
ited to purely genomic programs, but can involve 
interactions between the genome, epigenome and 
environment. 
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We have also argued that degeneracy has been a 
critical and overlooked factor in the evolution of 
new phenotypes. Elements of a degenerate system 
are free to functionally diverge over evolutionary 
time without compromising the existing functional 
output. New random variants that are not immedi- 
ately selectively advantageous can be maintained in 
degenerate systems, and through progressive accu- 
mulation of new variations interacting with plastic 
exploration of existing gene regulatory mechanisms 
new functional pathways and phenotypes can 
emerge. We have argued that these processes could 
be a driving force in the evolution of new forms of 
physiology and behaviour. Epigenomic influences on 
global regulatory networks in honeys bees in particu- 
lar have already brought a fresh perspective to the 
study of epigenetic regulation of development and 
behaviour [34,35,90—92], and degeneracy may oper- 
ate far more broadly in facilitating the evolution of 
new levels of complexity. Deacon [27], for example, 
offered a sophisticated epistemological approach to 
understand the role of global degeneracy in the evo- 
lution of communication generally. Such ideas are a 
reminder that the genetic code is not prescriptive and 
that biological research has to be context-driven. 
Complementary organisms such as birds [27] and 
honey bees offer useful systems in which we can 
test ecologically valid hypotheses about epigenetic 
morphodynamic processes. Using honey bees and 
other social insects to study relaxed selection, 
global degeneracy and social communication is al- 
ready revealing how radical new forms of social be- 
haviour and communication have evolved [93—95], 
and could deliver a fatal sting to the idea that adap- 
tations arise through saltatory events or because they 
are immediately beneficial to a species. 

The concept of degeneracy is an important the- 
oretical tool for unpacking the heterogeneous con- 
struction of phenotypes as well as the multifarious 
variable intersecting pathways that compose biolo- 
gical systems. The reiterations of living systems are 
developmentally constructed and come into being 
through internal interactions within organisms, inter- 
actions between organisms, and interactions between 
organisms and their surroundings. Novelty and vari- 
ation arise from these interactions. The functional 
redistribution of biological activity onto a group 
of interacting organisms and their environment 
effectively offloads a degree of genetic control onto 
epigenetic processes. Degeneracy introduces a sus- 
ceptibility to developmental influences and enables 



epigenetic adaptation to the environment. In other 
words, degeneracy opens a developing organism up 
to variable causal factors, environmental modification 
and historical contingency. The availability and re- 
cruitment of functional extrasomatic resources cre- 
ates the possibility for developmental adaptation to 
the environment without severely compromising the 
integrity of the system [75]. In this situation, devel- 
opmental information is distributed throughout an 
array of internal and external components that each 
fractionally influences the ongoing dynamics of the 
system. Degeneracy is not a reductionist account of 
self-organizing, dynamical and living systems. By 
understanding biological systems as degenerate, re- 
searchers can develop a framework aimed at model- 
ling the coevolutionary and codevelopmental 
relationship between organisms and their 
environment. 
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